Concept
Big data
Variants
Big Data Science
Parents
High Performance ComputingAstrodynamicsBig Data AnalyticsQuality Management SystemsConstruction Automation
48K
Publications
3M
Citations
113K
Authors
11.2K
Institutions
Warehouse-to-Stream Analytics
1996 - 2004
This period fused enterprise data warehousing with emerging streaming and approximation methods, yielding predictable Online Analytical Processing (OLAP) via materialized views alongside one-pass, sublinear-space synopses for massive, high-velocity data. It normalized continuous queries, windowed operators, and in-situ filtering near archival stores, while advancing privacy-aware governance that enabled controlled cross-organization analytics.
• Enterprise analytics centered on data warehouses emphasized precomputation and schema design: conceptual multidimensional modeling and view selection to materialize aggregates, enabling predictable query performance over large integrated stores [1], [8], [13], [14].
• Sublinear-space approximation became a core analytic strategy for massive datasets and streams, using synopsis data structures and randomized sketches to estimate norms and query answers in one pass with controllable error [5], [6], [15], [17].
• Big Data shifted from batch mining to continuous monitoring, framing streams as first-class managed data with standing queries, online analytics, and windowed operators, supported by streaming algorithms for low-memory updates [3], [5], [17].
• Scalable knowledge discovery prioritized algorithmic redesign—parallelization, data partitioning, and index-aware computation—to mine large datasets: parallel decision-tree classification, grid-based clustering, fast principal component analysis, and distance-based outlier detection [4], [7], [9], [10], [11], [19].
• Systems work pushed computation toward the data via specialized middleware, archival storage interfaces, and access-control and privacy-aware data governance, enabling in-situ filtering of scientific archives and controlled access to large indexes and linked datasets [2], [16], [20].
Popular Keywords
Declarative Scale-Out MapReduce Analytics
2005 - 2011
Cloud-Native Big Data Analytics
2012 - 2017
Capability-Centric Edge-to-Cloud Analytics
2018 - 2024